CDS

Accession Number TCMCG075C14625
gbkey CDS
Protein Id XP_017974994.1
Location join(30006567..30006712,30006806..30006863,30006940..30007044,30007487..30007544,30008238..30008413,30008754..30008823,30008917..30009212)
Gene LOC18603397
GeneID 18603397
Organism Theobroma cacao

Protein

Length 302aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018119505.1
Definition PREDICTED: probable prolyl 4-hydroxylase 4 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category E
Description prolyl 4-hydroxylase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01252        [VIEW IN KEGG]
KEGG_rclass RC00478        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K00472        [VIEW IN KEGG]
EC 1.14.11.2        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00330        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00330        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004656        [VIEW IN EMBL-EBI]
GO:0006082        [VIEW IN EMBL-EBI]
GO:0006464        [VIEW IN EMBL-EBI]
GO:0006520        [VIEW IN EMBL-EBI]
GO:0006575        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016491        [VIEW IN EMBL-EBI]
GO:0016705        [VIEW IN EMBL-EBI]
GO:0016706        [VIEW IN EMBL-EBI]
GO:0018126        [VIEW IN EMBL-EBI]
GO:0018193        [VIEW IN EMBL-EBI]
GO:0018208        [VIEW IN EMBL-EBI]
GO:0018401        [VIEW IN EMBL-EBI]
GO:0019471        [VIEW IN EMBL-EBI]
GO:0019511        [VIEW IN EMBL-EBI]
GO:0019538        [VIEW IN EMBL-EBI]
GO:0019752        [VIEW IN EMBL-EBI]
GO:0019798        [VIEW IN EMBL-EBI]
GO:0031543        [VIEW IN EMBL-EBI]
GO:0031545        [VIEW IN EMBL-EBI]
GO:0036211        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043412        [VIEW IN EMBL-EBI]
GO:0043436        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0044267        [VIEW IN EMBL-EBI]
GO:0044281        [VIEW IN EMBL-EBI]
GO:0051213        [VIEW IN EMBL-EBI]
GO:0055114        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0140096        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]
GO:1901605        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCTGCTGCGAGGAGTTTTCCTCTCCAATTCGCCATATCGCTCTCAATCGCTTCAATCTTGTACCAATGTTGTGATTCGTTCTTAACTCCTCCAAGCTCCATCATCAATCCTGCTAAAGCCAAACAAGTTTCCTGGAAACCTAGGGCTTTTGTCTATGAAGGCTTCTTAACGGACCTCGAATGCGATCATTTGATCTCTCTCGCGAAATCGGAGCTAAAGAGATCTGCAGTTGCTGATAATGTTAGTGGAAAGAGCAGGCTTAGCGAAGTCCGTACGAGCTCAGGAATGTTTATATCTAAGGGAAAGGATCCTATTGTTGCTGGTATAGAGGACAAGATTTCAACATGGACATTTCTTCCCAAAGAAAATGGGGAAGACATACAAGTGTTGAGATATGAGCATGGACAGAAATATGATCCACACTACGACTACTTTGTCGACAAGGTGAATATTGCCAGGGGTGGACACCGTATAGCAACTGTGCTGATGTATCTTACAGATGTGACCAAAGGTGGTGAAACAGTATTCCCCCAAGCAGAGGAATCTTCACGTCGTAAGACTCCTGCAACAGATGATGACCTCTCAGAATGTGCAAAGAAGGGAATTGCAGTGAAACCACGAAGAGGAGATGCCCTTCTCTTCTTCAGTCTCTCCCCAACTGCTATACCTGACCCAAGCAGTCTGCATGCTGGGTGCCCAGTGATTGAAGGTGAGAAATGGTCGGCAACAAAGTGGATTCATGTTGATTCTTTTGACAAGAATTTGGAAGCCGGTGGCAACTGCACAGATTTGAATGAGAGTTGTGAGAGATGGGCTGCTCTTGGTGAGTGCTCGAAGAACCCAGAGTATATGATTGGATCTGCAGCGCTTCCTGGCTATTGTAGGAGAAGCTGTAAAGTATGTTAG
Protein:  
MAAARSFPLQFAISLSIASILYQCCDSFLTPPSSIINPAKAKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFISKGKDPIVAGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDPHYDYFVDKVNIARGGHRIATVLMYLTDVTKGGETVFPQAEESSRRKTPATDDDLSECAKKGIAVKPRRGDALLFFSLSPTAIPDPSSLHAGCPVIEGEKWSATKWIHVDSFDKNLEAGGNCTDLNESCERWAALGECSKNPEYMIGSAALPGYCRRSCKVC